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We consider additive functionals of Markov processes in continuous time 
with general (metric) state spaces. We derive concentration bounds for 
their exponential moments and moments of finite order. Applications in- 
clude diffusions, interacting particle systems and random walks. In par- 
ticular, for the symmetric exclusion process we generalize large deviation 
bounds for occupation times to general local functions. The method is 
based on coupling estimates and not spectral theory, hence reversibility is 
not needed. We bound the exponential moments(or the moments of finite 
order) in terms of a so-called coupled function difference, which in turn is 
estimated using the generalized coupling time. Along the way we prove a 
general relation between the contractivity of the semigroup and bounds on 
the generalized coupling time. 
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1 Introduction 

The study of concentration properties of additive functionals of Markov processes is 
the subject of many recent pubhcations, see e.g. [5], g]. This subject is strongly 
connected to functional inequalities such as the Poincare and log-Sobolev inequality, 
as well as to the concentration of measure phenomenon [6]. In the present paper we 
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consider concentration properties of a general class of additive functionals of the form 
ft{Xt) dt in the context of continuous-time Markov processes on a Polish space. 
The simplest and classical case is where ft — f does not depend on time. However 
the fact that time-dependent functions ft are allowed can be a significant advantage 
in applications. 

Our approach is based on coupling ideas. More precisely, we estimate exponential 
moments or fc-th order moments using the so-called coupled function difference which 
is estimated in terms of a so-called generalized coupling time, a generalization of the 
concept used in [3] . Because of this approach no knowledge about a possible stationary 
distribution is required. 

Our method covers several cases such as diffusion processes, jump processes, random 
walks and interacting particle systems. The example of random walk shows that for 
unbounded state spaces, the concentration inequalities depend on which space the 
functions /* belong to. 

The main application to the exclusion process, which has slow relaxation to equilib- 
rium and therefore does not satisfy any functional inequality such as e.g. log-Sobolev 
(in infinite volume), shows the full power of the method. Besides, we give a one-to-one 
correspondence between the exponential contraction of the semigroup and the fact that 
the generalized coupling time is bounded by the metric. For discrete state spaces, this 
means that the semigroup is exponentially contracting if and only if the generalized 
coupling time is bounded. 

Our paper is organized as follows: in Section [2] we prove our concentration inequal- 
ities in the general context of a continuous-time Markov process on a metric space. 
We derive estimates for exponential moments and moments of finite order. In Section 
[3] we study the generalized coupling time and its relation to contractivity of the semi- 
group. Section[4]is devoted to examples. Section [5] deals with the symmetric exclusion 
process. 



2 Concentration inequalities 

Let X = {Xt)t>Q be a Feller process in the Polish state space E. Denote by its 
associated measure on the path space of cadlag trajectories D[o,oo[(£') started m x (z E 
and with 

df-^cr{X,;0<s<t}, t > 0, 

the canonical filtration. We denote by E the expectation with respect to the measure 
Pa;. For V a probability measure on E, we define := J^x v{dx), i.e. expectation in 
the process starting from u. The associated semigroup we denote by {St)t>o and with 
A its generator, both considered on a suitable space {B{E),C{E),Co{E), ...). 

The content of this section is to derive concentration inequalities for functionals of 
the form 

F{X) / ft{Xt)dt, ft-.E^R. (1) 
Jq 
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The most familiar case is when F is of the form 

/ f{Xt)dt, 
Jo 

i.e. ft = f for t <T and /t = for t > T. We first formulate conditions on the family 
of functions /( which we will need later. 

Definition 2.1. We say the family of functions {ft,t > 0} is k -regular for k G N, if: 

a) The ft are Borel measurable and t i— )• ft+s{Xs) is Lebesgue-integrable ¥x-a.s. for 
every x e E,t>0, andE^c \ ft+siXs) \ ds < oo; 

b) Ex sup I ft+s{Xs) I*" is well-defined and finite for t > 0, x & E arbitrary and 

0<s<e 

e > small enough; 

c ) There exists a function r : i? — > M and eo > such that for < e < eo and x €z E 

/"OO 

supE^; / I ft+^+s{Xs) - ft+siXs) I ds < er{x) 
t>o Jo 

andExr{Xe)'^ < oo. 

Remark If F{X) = f{Xt)dt, then E^^ sup \f{Xt)\'' < oo for some eo > 

0<t<T+eo 

implies conditions b) and c) of the fc-regularity. In condition b) the statement of 
well-definedness can be replaced by the existence of a measurable upper bound. 

The technique to obtain concentration inequalities for functionals of the form ([T|) is to 
use a telescoping approach where one conditions on ^t, i-e., where we average F(1C) 
under the knowledge of the path of the Markov process X up to time t. 

Definition 2.2. For < s < t, define the increments 

As,t ■.^E[F{X)\dt]-E[F{X)\ds] 
and the initial increment 

A,,o :=E[F(X)|;?o]-E,[F(X)], 
which depends on the initial distribution v. 

The basic property of the increments is the relation A^ ^, — As_t + At_i, for s <t < u. 
Also, we have 

E[F(X)|5t]-E,[F(X)] - A,^o + Ao,T, 

where we have to use A^^o to accommodate for the initial distribution u. To better 
work with the increment Ag^t, we will rewrite it in a more complicated but also more 
useful way. 
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Definition 2.3. Given the family of functions {ft : t > 0}, the coupled function 
difference is defined as 

/"OO 

^t{x,y) ■■= / Suft+u{x) ~ Suft+u{y)du. 

JQ 

Remark We call the coupled function difference because later we will see that we 
need estimates on | $f |, and for a coupling E of X starting in x and y we have the 
estimate 



$f(x,y) < / E^^yl ft+u{Xu) " ft+u{Yu) I du. 
Jo 

In the next lemma we express the increments Ag^t in terms of the coupled function 
difference $(. 

Lemma 2.4. 

Am fu{Xu) ' Su-sfu{Xs)du+[St-sMXt,-)]{Xs). 
Proof First, we note that 

E[F(X)|5t]= / fu{Xu)du+ / Su^tfu{Xt)du, 



and 



E[F(X)|5,]= / f^{X^)du+ / Su-sfu{Xs)du + 

Jo J s 

Hence, 

A,,t =E[F(X)|5t]-E[^^(X)|;?,] 



St~s / Su-tfu du 



{X, 



fu{Xu) - Su-sfu{Xs) du + St- 



Su~tfu{Xt) — Su-tfu du 



(Xs) 



fu{Xu) - Su-sfuiXs) du + [St-sMXt, ■)]iXs). 

□ 

The following lemma is crucial to obtain the concentration inequalities of Theorems 
12.61 and 12.91 below. It expresses conditional moments of the increments in terms of the 
coupled function difference. 

Lemma 2.5. Fix k (z k > 2. Assume that the family (ft) is k-regular and suppose 
that $((•, x)^ is in the domain of the generator A for all x E E. Then 

]im-E [Alt+. I ^t] = {A{M;Xt)'')){Xt). 
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Proof We will use the following elementary fact repetitively. For > 2, if | 6^ | < e6g 

—k 

and sup E 6^ < oo , then 

0<e<eo 



lim -¥.{a^ + h,f = lim -'Ea';. 



By Lemma 



e-i-O e 



(2) 



fu{Xu) - Su-tfuiXt) du + [S,^t+e{Xt+,, ■)]{Xt). 



First, we show that we can neglect the first term. Indeed, 

fu{Xu) - Su~tfu{Xt) du 



<e sup \ft+s{Xt+s)\+eEl^ sup 

0<s<e 0<s<e 



we can use part b) of the /c-regularity to apply fact ^ and get 



lim -E [A,\+, I ^t] = lim \[S,^t+,{Xt+,, •)]' (Xt) 



Next, by writing $t+c = $t + ($t+e — <&t), we will show that the difference can be 
neglected in the limit e — > 0. To this end, we observe that 



\<i>t+eix,y) ~^tix,y)\ < / 

Jo 



'■X I ft+e+u{Xu) — ft+u{Xu) I du 
+ / E^ I ft+e+u{Xu) - ft+u{Xu) I du. 

Jo 

Part c) of the fc— regularity condition allows us to invoke fact ^ again to obtain 



lim-E[At+J;?,] 



Ie 



[S^MXt+.rt (Xt) 



Finally, to replace Se^t{Xt+e, ■) by ^t{Xt+e, ■) by applying fact ^ for a third time, 
we estimate 



< 



< E, 



Su+eft+u+t{x) - Su+tft+u{x) du 



Suft+u{x)du 



\ft+u+t{Xu+e)-ft+u{Xu+t)\du + eEx sup /t+„(X„), 

0<u<e 



where parts b) and c) of the fc-regularity then provide the necessary estimates. Now, 
the desired result is immediately achieved: 



1 



1 



lim -E [A^,,+, I = lim - S,{<^t{-,Xt)t {Xt) 



e->0 e 



e-i-0 e 



A^t{:Xt)\Xt). 



□ 



5 



We can now state our first main theorem, which is a bound of the exponential moment 
of F{X) in terms of the coupled function difference $4. 

Theorem 2.6. Assume that for all fc e N, the ft are k-regular and $*(•, x)'^ E dom{A) 
for all X Cz E. Then, for any distributions fj, and v on E , 



logE, 



logE, 



<log(co)+ / sup^^(A(<ff(.,a;)))(x)di 



x£E 



k=2 

00 



>log(co)+ / inf VT:y(^(<ft'(-,a;)))(x)dt, 



fc=2 



where the influence of the distributions fi and v is only present in the factor 



Co 



,i^(*o(a:,-)) 



^,{dx) 



Remark li Ht : Ex E is an upper bound on | | and Ht{x, a;) = for all x G E, then 
the upper bound of the theorem remains valid if $t is replaced by Hf. In particular, 
if ft = flt<T, Ht := I $0 I lt<T serves as a good initial estimate to obtain the upper 
bound 



log I 



< log(co) + T sup ^ -A I $0 I' {■,x)ix). 



Further estimates on | $0 I specific to the particular process can then be used without 
the need to keep a dependence on t. 

Proof Define 

We see that for e > 0, 

^{t + e) - l'(t) = {e^'-°+'^°-*E [e'^*-'+= " 1 1 5?*]) 



= E^(, 



E [e^"^^~At,t+e~l\St]), 



where we used the fact that E [Aj^t+elS^t] = 0. Hence, using Lemma 12.51 we can 
calculate the derivative of 



/ 00 
= E^ ( e^*-+^"- ^iA{'ft{;Xtr)){Xt) 



k=2 



To get upper or lower bounds on ^f', we move the sum out of the expectation as a 
supremum or infimum. Just continuing with the upper bound, as the lower bound is 
analogue. 



00 

vI/'(0<*Wsup^-(A($^(.,x)))(x). 
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After dividing by ^'(t) and integrating, we get 

»T oo 

ln*(T)-lnvI/(0) < / sup^-(A($f(-,a:)))(a;)dt, 

Jo xGEt^kl 



k=2 



which leads to 



The value of cq = ^(0) = E^e'^* ° is obtained from the identity 



How the bound in Theorem 12.61 can be used to obtain a deviation probability in the 
most common case is shown by the following corollary. 

Corollary 2.7. Assume that F(X) = fi^t) dt, the conditions of Theorem \2.b\ are 

satisfied, and sup^.^^; ^ | $0 1*^ {■,x){x) < C1C2 for some ci, C2 > 0. Then, for any initial 
condition x Cz E, 



P^(F(X) - Ea;F(X) > x) < e ^^+37^ . 
Proof By Markov's inequality, 

P^(F(X) - E^F(X) > a;) < e ^e^-^t^^^^^^^^^'e"^^ 



where the last line is the result from Theorem l2.6l Through optimizing A, the exponent 
becomes 

^_(rci + ^)iog(-^ + i). 

C2 C2 1 C1C2 

To show that this term is less than „ ^ ^ , we first rewrite it as the following in- 
equality: 

2 C2 -4- — 

TciC2 Tci + ^ 

Through comparing the derivatives, one concludes that the left hand side is indeed 
bigger than the right hand side. □ 

In applications one tries to find good estimates of $t . When looking at the examples 
in SectionlH finding those estimates is where the actual work lies. In the case where the 
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functions ft are Lipschitz continuous with respect to a suitably chosen (semi)metric p, 
the problem can be reduced to questions about the generalized coupling time h, which 
is defined and discussed in detail in Section [3] In case that the exponential moment of 
F(X) — EF(X) does not exist or the bound obtained from Theorem 12.61 is not useful, 
we turn to moment bounds. This is the content of the next theorem. 

Lemma 2.8. Assume that the ft are 2-regular and ^1{-,x) is in the domain of the 
generator A. Then the predictable quadratic variation of the martingale (Ao,t)t>o is 



(Ao.),= f A<^li-,X,)iX,)ds. 
Jo 



Proof We have, using Lemma [2.51 for k — 2, 



dt 



(Ao, 



lim -E [A^tt^^ 



A^U;Xt)iXt). 



□ 



Theorem 2.9. Let the functions ft be 2-regular and in the domain of the 

generator A. Then 



(EJF(X)-E,F(X) <Cp 



A^^ti-,Xt){Xt)dt 



E, 



snp\ MXt,Xt^) 



(3a) 

(3b) 
(3c) 



oo. 



where the constant Cp only depends on p and behaves like p/logp as p 
Proof By the triangle inequality, 

(E^ I F(X) - E,F(X) < (E^ I Ao.oo + (E^ | A,,o ■ 

Since (Ao,t)t>o is a square integrable martingale starting at 0, a version of Rosenthal's 
ineaualitvf|10j. Theorem 1) implies 



(EJAo,T <Cp 



En sup |Ao,t-Ao,t- 



0<t<T 



Applying Lemma [2.8l to rewrite the predictable quadratic variation (Ao,.)t and Lemma 
l2.4l to rewrite At__t, we end with the first two terms of our claim after letting T — > c». 
The last term is just a different way of writing A^^o^ 



i/($o(x,-))r Kdx) 



□ 



8 



Let us discuss the meaning of the three terms appearing on the right hand side in 
Theorem 

a) The first term gives the contribution, typically of order T?, that one expects 
even in the simplest case of processes with independent increments. 

E.g. if fi is an invariant measure and -F(X) — fi^t) dt, then 

p 

In many cases (see examples below), / (^A^q{-,x){x)) ^ ^{dx) can be treated as 
a constant, i.e., not depending on T. There are however relevant examples where 
this factor blows up as T — >■ c». 

b) The second term measures rare events of possibly large jumps where it is very 
difficult to couple. If the process X has continuous paths, this term is not present. 
Usually this term is or bounded or is of lower order than the first term as T — > cxi. 

c) The third term has only the hidden time dependence of $o on T. It measures 
the intrinsic variation given the starting measures /i and v and it vanishes if and 
only a II = V — Sx- 

It is also interesting to note that the estimate is sharp for small T: If one chooses 
F(X) — ^ fi^t) dt and looks at the limit as T 0, the first two terms disappear 

and the third one becomes (/ | f{x) -— v{f) \^ ^{dx))^ , which is also the limit of the 
left hand side. 



3 Generalized coupling time 

In order to apply the results of Section [5] we need estimates on $(. We can obtain 
these if we know more about the coupling behaviour of the underlying process X. To 
characterize this coupling behaviour, we will look at how close we can get two versions 
of the process started at different points measured with respect to a distance. 

Let p : E X E ~^ [0, oo] be a lower semi-continuous semi-metric. With respect to 
this semi-metric, we define 

II / ll^.p := inf {r > | f{x) - f{y) < rp{x,y) yx,yeE}, 

the Lipschitz-seminorm of / corresponding to p. Now we introduce the main objects 
of study in this section. 

Definition 3.1. a) The optimal coupling distance at time t is defined as 
Pt{x,y) := inf p{x' ,y') TT{dx' dy'), 

TTe'0iS(5^St,dySt) J 

where the infimum ranges over the set of all possible couplings with marginals 
5xSt and SySt, i.e., the distribution of Xt started from x ory. 
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b) The generalized coupling time is defined as 



h{x,y) 



pt{x,y)dt. 



Now that we have introduced the generahzed couphng time, as first apphcation we 
obtain, using the remark following Theorem 12.61 



Corollary 3.2. Assume the functions ft are Lipschitz continuous with respect to a 
semi-metric p, and that the conditions of Theorem \2. 6\ hold true. Then 



E, 



^F(X)-E„_F(X) 



E ^ snp{Aih>'{-,xmx) 



where 



Co 



supll ft \\^-j,u{h{x,-)) 



tiidx), 



Ck = / sup II ft dt. 

Jo t>0 

In particular, if ft = f for t <T and ft = for t > T, then 



Ck<T\\f\ 



L ip ' 



Remark If h is an upper bound on the generalized coupling time h with /i(a;, x) = 0, 
then the result holds true with h replaced by h. 

Proposition 3.3. The optimal coupling distance pt has the dual formulation 

pt{x,y) = sup (Stfix) - Stf{y)). 
Il/lli,.p=i 

Proof By the Kantorovich- Rubinstein theorem ([H], Theorem 1.14), we have 



inf 



pdw^ sup / fd{SxSt)- / fd{SySt) 

il/iL.p=iL-' J 
= sup [{Stf)ix) - {Stf)iy)] . 



□ 



Also, it is easy to see that the semi-metric properties of p translate to pt and thereby 
to the generalized coupling time h. 

Proposition 3.4. Both the optimal coupling distance pt and the generalized coupling 
time h are semi-metrics. 
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Proof We only have to prove the semi-metric properties of pt, they translate naturally 
to h via integration. 

Obviously, pt{x,x) = and pt{x,y) — pt{y,x) is true for all x,y ^ E \>y definition 
of pt- For the triangle inequality, we use the dual representation: 

pt{x,y)^ sup {Stf{x) ~ Stfiy)) 
ll/IL.,=i 

= sup iStf{x)-Stfiz) + Stf{z)~Stfiy))<pt{x,z)+ptiy,z) 
ll/IL.p=i 

□ 

A first result is a simple estimate on the decay of the semigroup St in terms of the 
optimal coupling distance. 

Proposition 3.5. Let p be a stationary probability measure of the semigroup St- Then 

< II/IIl.p (/ Kdx) iyj p{dy)pt{x,y) ^ 

Remark When we choose the metric p to be the discrete metric Ix^y (a choice we 
can make even in a non-discrete setting), we can estimate pt{x, y) by ^x.y{T > t), the 
probability that the coupling time t — inf > | — X"^ Vs > t} is larger than t 
in an arbitrary coupling Px,y of the Markov process started in x and y. In this case, 
the result of Proposition 13.51 reads 

II Stf - pif) < II / ||_ (^J p{dx) (^j p{dy)%jT > ' , 

where || / H^^^^ = sup^ yifi^) ~ fiu)) is the oscillation norm. Note that this can also 
be gained from the well-known coupling inequality 

II 6xSt - SySt W^var ^ 2P,,;,(t > t). 

Proof of Proposition [3751 First, 

I Stf{x) - pif) I - I Stf{x) - piStf) I 

Kxf{Xt)~ I p{dy)Eyf{Yt) 

< f p{dy)\Exf{Xt)-Eyf{Yt)\ 



< / Kdy)\\f\\upPt{x.y)- 
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This estimate can be applied directly to get the result: 

II Stf - m(/) IIl.(^) - (/ Kdx) I SJ{x) - f^if) ' 

<II/IIl.p (/ f^idx) (^J fi{dy)pt{x,y) 



□ 



The above result did not use the semigroup property of St- When we use it we can 
improve estimates considerably. The price is that from now on, p has to be a metric, 
and this metric must be compatible with the Markov process, which we will define a 
little bit later under the notion of contraction with respect to this metric. The aim 
is to show how the uniform boundedness of the generalized coupling time implies an 
exponential decay of the semigroup (St) in the Lipschitz seminorm. To this end, we 
need the following lemma: 

Lemma 3.6. Under the condition that p is a metric, 

Proof By the representation of the optimal coupling distance in Proposition 

Pt{x,y) Stf (x) - Stf {y) 
sup — = sup sup 

p{x,y} ^^y||yj,^^^=i p{x,y) 



sup \\Stfh^,,„ = \\St _ 

□ 



iLip — II '^t \\Lip 

l/IL,„ = l 



Definition 3.7. We say that the process X acts as a contraction for the distance p if 
pt{x,y) < p{x,y) Vt>0, (4) 
or equivalently, 

\\Sth,^<i yt>o. 

This property is sufficient to show that the process is contracting the distance mono- 
tonely: 

Lemma 3.8. Assume that the process X acts as a contraction for the distance. Then 
Pt+s{x,y) < pt{x,y) yx,y e E,s,t>0. 
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Proof Using the dual representation, 

Pt+s{x,y)= sup [St+sfix) - St+sf{y)] 
ll/IL,p=i 

= sup [St{Ssf){x)-St{Ssf){y)]. 
II/IIl,p<i 

By our assumption, the set of functions / with || / jj^^^ < 1 are a subset of the set of 
functions / with || 5^/ < 1. Hence, 

Pt+s{x,y)< sup [St{Ssf){x) - St{Ssf){y)] 
f--\\Ssfh,^<i 

< sup [Stg{x) - Stg{y)] = Pt{x,y). 

Ilsll<i 

□ 

With this property in mind, we can show the main theorem of this section. 

Theorem 3.9. Assume that p is a metric and that the process X acts as a contraction 
for the distance. Then the fact that the generalized coupling time h is bounded by the 
metric p is equivalent to the fact that the semigroup {St) is exponentially contracting. 
More precisely, for a > 1 arbitrary, 

a) Vx,yeE: h{x,y)<Mp{x,y) ^ Vi > A/a : || 5* < 

b) \\STh,p<^ ^ yx,yeE: h{x,y)<f^,p{x,y). 
Proof a) For x,y £ E, set 



T,,y inf <! t > 



Pt{x,y) < -p{x,y) 
a 



Then, 

Mp{x,y) > h{x,y) ^ I Pt{x,y)dt> I pt{x,y)dt>T^^y-p{x,y). 



Therefore T^.y is bounded by Ma. By Lemma l5^ pt [x, y) < px^ ^ {x, y) for all t > T^.y. 
Hence pMa{x,y) < ^p{x,y) uniformly, which implies || Smo, W^ip < ^■ 
b) Since pt{x,y) < p{x,y) \\ St 11^,^, 

Kx,y)^ Pt{x,y)dt < p{x,y) WStW^.dt 
Jo Jo 

aT 

k=0 

□ 



< Pix,y)T^\\ St < '^I'jP^^^y^- 
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When we apply this theorem to an arbitrary Markov process where we use the discrete 
distance, we get the following corollary: 

Corollary 3.10. The following two statements are equivalent: 

a) The generalized coupling time with respect to the discrete metric p{x,y) = Ix^y 
is uniformly bounded, i.e. 

h{x,y)<M \fx,yeE; 

h) The semigroup is eventually contractive in the oscillation (semi)norm, i.e. \\ St < 
1 for some T > 0. 

Remark Theoreni l3.9l actually gives us more information, namely how the constants 
M and T can be related to each other. 

Proof Since obviously sup pt {x,y) < 1, the process X acts as a contraction for the 

discrete distance and the result follows from Theorem 13.91 where we also use the fact 
that in the case of the discrete metric, || • H^-^ = \\ ■ \\^^^. □ 

Since Theorem 13.91 part a) implies that || S't decays exponentially fast, it is of 
interest to get the best estimate on the speed of decay, which is the content of the 
following proposition: 

Proposition 3.11. Assume that p is a metric, the process X acts as a contraction for 
the distance and the generalized coupling time h satisfies h{x,y) < Mp{x,y). Then 

lim - log II StWr- < - — ■ 

Proof The proof uses the same structure as the proof of part a) in Theorem l3.9l First, 
fix e between and -jj . Then define 

T,,y = inf {<> I pt{x, y) < p[x, y)e-(^-^)*} . 
By our assumption, 

Mp{x,y)>h{x,y)>p{x,y)J e dt ^ M p{x,y) ^ _ . 

Since the fraction on the right hand side becomes bigger than 1 if Tx,y is too large, 
there exists an uniform upper bound T(e) on T-x^y. Hence, for all t > T{t), \\ St H^j^ < 
e~*-"S'~'^)*, which of course implies lim j \\ St < —jj + e. By sending e to 0, we 
finish our proof. □ 

Again, we apply this result to the discrete metric to see what it contains. 



14 



Corollary 3.12. Let F^^y be a coupling of the process X started in x resp. y, and de- 
note with T := inf > | = Vs > the coupling time. Set M :— sup K^.yT. 



Then 



x,yeE 



ln^ilog||^.IU<-^. 
Equivalently, for f G L°° , 

lmiilog||^J-M(/)ll.<-^, 
where /i is the unique stationary distribution of%. 

Remarks a) If the the Markov process X is also reversible, then the above result 
extends to L-'^ and hence to any L^, where the spectral gap is then also at least 
of size ^. 

b) As an additional consequence, when a Markov process can be uniformly coupled, 
i.e. sup E x.yT < M < oo for a coupling E , then there exists (a possibly 

x,yGE 

different) coupling Kx.y, so that sup E^.y^'^'^ < oo for all A < -jg. Note that 

x,y£E 

without Corollarv 13.121 this property is obvious only for Markovian couplings. 



4 Examples 

4.1 DifTusions with a strictly convex potential 

Let V he a, twice continuously differentiable function on the real line with V" > c > 
and f e^'^^'^^dx = Zy < oo. To the potential V is associated the Gibbs measure 

Zv 

and a Markovian diffusion 

dXt = -V'{Xt) + V2dWt 

with fly as reversible measure. 

To estimate the optimal coupling distance pt at time t(see Definition 13. ip . we couple 
two versions of the diffusion, started in x and Xf started in y, by using the same 
Brownian motion (Wt)t>Q- Then the difference process Xf — X^ is deterministic, x < y 
implies X^ < Xfl and by the convexity assumption 

d{xy - xn = ~{v\x\) - v'{xt)) < -~c{x\ - xn. 

Using Gronwall's Lemma, we obtain the estimate 
Pt{x,y) < Ix-yle^"* 
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on the optimal coupling distance. By integration, the generalized coupling time h has 
the estimate h{x,y) < -\x — y\. As a consequence, Proposition 13. 1 ll implies 



]im log \\Sth,p< 



Since the generator A of the diffusion is 
dx'^ dx ' 

we have 

Vc ' 7 ^ ^ [O, k> 2. 

Therefore, for / : M — > R be Lipschitz-continuous, we can use Corollary 13.21 to get the 
estimate 



Jo }(Xt)dt-Ti^^ j;^ f(Xt)dt 



< C^t,V2(i^ ' , (5) 



I / 1 



with the dependence on the distributions vi and V2 given by 

Remark a) An alternative proof that strict convexity is sufficient for ^ to be true 
can be found in [12]. A proof via the log-Sobolev inequality can be found in [6]. 
Hence the result is of no surprise, but the method of obtaining it is new. 

b) This example demonstrates nicely how in the case of diffusions the higher mo- 
ments of Ah^{-^x){x) can disappear because the generalized coupling time is 
bounded by a multiple of the initial distance. 

c) The generalization to higher dimensions under strict convexity is straightforward. 

4.2 Interacting particle systems 

Let E — {0, 1}^ be the state space of the interacting particle system with a generator 
L given by 

^/(^)=E E c(ry,x + A)[/(77-+^)-/(77)], 

where rj^ denotes the configuration rj with all spins in A flipped. This kind of parti- 
cle system is extensively treated in [7]. For / : £^ — > R, we denote with 5f{x) :— 
sup f{rf) — f{r]) the maximal influence of a single flip at site x, and with Sf = 

{6f{x))xeE the vector of all those influences. 

If there is a way to limit how flips in the configuration affect the system as time 
progresses, then we can obtain a concentration estimate. Again, denote with F{r].) = 
lo fivt) dt the additive functional of the function / and the particle system ij.. 
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Theorem 4.1. Assume there exists a family of operators At so that Ss^f < AfSf, and 
write 



G 



Jo 



Atdt, 



which is assumed to exist. Denote with 
Ck := sup ^ c(?7, a; + A) I A I*" 

the weighted maximal rate of spin flips. If \\ G \\p^2 ^ °° Z^*" some p>\, then for any 
f with 6f ^ £P and any initial condition rj <E E, 



Ck \\G\ 



fc=2 



k\ 



If additionally \\G\\^ < oo and \\\f\\\ := < oo, then for any two probability 

distributions vi, 1^2, 



Gill III/ III +rE 



Ck\\G\ 



ip->-2 



\Sf\ 



k=2 



k\ 



Applications of this Theorem are for example spin flip dynamics in the so-called 
M < e regime, where there exists an operator F with || F ||-^ = M, so that 

5s,f < e-*(-r)^/ 

holds. Since G = e-*^^-^) dt={e- T)-\ || G ||i < (e - M)-\ Hence || G W^^^ < 
(e — M)~^ for a first application of the Theorem. If the process is reversible as well, 
II G 11,^ = II G 111, and by Riesz-Thorin's Theorem, we have || G II2 < (e - hence 
we get the result for functions / with || (5/ II2 < 00. 

Another example is the exclusion process. As a single discrepancy is preserved and 
moves like a random walk, At{x, y) = pt{x,y), the transition probability of the random 
walk. In high dimensions, G{x,y) = pt{x,y) dt has bounded £^ — )• £^-norm: 

l|G||i^2= sup ^{^G{x,y)g{y)r 

llslli=l X y 

< sup 5]5^|5(y)|G(x,y)2<^G(x,0)2oo 

llslli^l X y X 



pOO pOO pOO pOO 

/ / Y,Pt{0,x)ps{0,x)dsdt= / / ps+t{0,0)dsdt 
Jo Jo ^ Jo Jo 



< 00 



in dimension 5 and higher. As the exclusion process switches two sites, Cfe < 2*^, and 
hence 



E^e^(''-)-^''^('' ) <exp 



2'= ||G||t^2lll / 



k=2 



l->2 
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However, this is only a quick result exploiting the strong diffusive behaviour in high 
dimensions. In the last section we will deal with the exclusion process in much more 
detail to obtain results for lower dimensions as well. 

Proof of Theorem 14.11 First, we notice that the coupled function difference $t for 
a single flip can be bounded by 

POO 



< / 5s,f{x)dt< / {At5f){x)dt 

JQ Jo 

< {GSf){x) 

uniformly in r/. To estimate the coupled function difference we telescope over single 
site flips, 

^1ir^^+^,rj)<\AfiiG6f)ix))', 
and therefore 

L^^{;V){V)^Y1 E c(r,,x + A)<I)f(r;-+^,,7) 

<E E c{v,x + A)\A\''iG6f)'^{x) 

<c,\\GSf\\l<ck\\GSf\\';<c,\\G\\';^,\\Sf\\'; 

Hence the first part is proven by applying these estimates to Theorem l2.6l for fixed and 
identical initial conditions. To prove the estimate for arbitrary initial distributions, 
we simply observe that, again by telescoping over single site fiips, 

Mv, < E ™p or , c) < E(^^/)(^) < II G 111 II / II 1 . 



□ 



4.3 Simple symmetric random walk 

The aim of this example is to show that we can get concentration estimates even if 
the process X - in this example a simple symmetric nearest neighbour random walk in 
Z''' - has no stationary distribution. We will consider three cases: / € £^{Z'^), 
and £°°(Z''), and F(X) = f{Xt)dt. To apply Theorem [2:61 our task is to estimate 
I ^t{x,y) I where y is a neighbour of x. We will denote with pt{x,z) the transition 
probability from a; to z in time t. We start with the estimate on the coupled function 
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difference 

\^t{x,y) I 



T-t 



E,f{X,)-EyfiX,)ds 

rT-t 

I /(^)(Ps(a;,^) -Ps(2/,^;))(is 

^0 



< 



z 



T-t 



Ps{x,z) -ps{y,z)ds 



Ps{x,z) - ps{y,z)ds 



Now, depending on the three cases of /, we proceed differently. First, let f G i^. Then, 



< 



isup 

z 

T 



/ Psix,z)-ps{y,z)ds 
Jo 

/ Ps{x,z) -ps{y,z)ds 
Jo 

1 / p,{0,0)-p,{y-x,0)ds<C,\\f\\^. 
Jo 



Since \ x — y\ = 1, the constant Ci = Ps{0,0) — Ps{y — x,0)ds depends on the 
dimension but nothing else. 
Second, let / € Then, 



\^tix,y)\<J2\fi^)\ 

Z 



< 



/ Psix,z)-ps{y,z)ds 
Jo 

/ ps{x,z) -psiy,z)ds 
Jo 

Jo 



- \\Ps{x,-)-P8{y,-) WrVar 



< 



, [ %,y{T > s) ds 
Jo 



In the last line, we used the coupling inequality. The coupling 'Px,y is the Ornstein 
coupling, i.e., the different coordinates move independently until they meet. Since x 
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and y are equal in all but one coordinate, the probability of not having succeeded at 
time t is of order Hence we end up with 

\^t{x.y)\<c^\\f\\^Vf. 

Third, let / € l"^. This is the most interesting case. 
Lemma 4.2. Let x^y he neighbours. Then 



V / pt{x,z) -pt{y,z)dt\ <a{T) 



with 



(0{VT), d^l; 
a{T)e loilogT), d = 2; 
[o{l), d>3. 

Proof By expanding the product and using the fact that ^pt(a, z)ps{b, z) — pt+s{a, b) 

z 

Pt+s{a~ 6,0), we get 

2 



^ pt{x,z)-pt{y,z)dtj =2^ ^ i 



Pt+s (0,0)- pt+s {x~y,0)dt ds 



= 2 f f {-A)pt+s{-,0mdtds^2 f p,{0,0)-pT+s{0,0)ds 

JQ Jo Jo 

<2 [ p,(0,0)ds=:a(T). 
Jo 



Using first the Cauchy-Schwarz inequality and then Lemma 14.21 



□ 



2\ 2 



pt{x,z) - pt{y,z)dt] <||/||2a(r)" 



To conclude this example, we finally use the uniform estimates on $4 to apply Theorem 
1 and obtain 



E X exp 
E^exp 







r f{xt)dt 


< exp 


Jo 








r f{xt)dt 


< exp 


Jo 





gnu 111 

^ fc! 

. k=2 

00 II r \\h 

I k=2 
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and 



E, 



exp 



fiXt)dt~E, 



f{Xt)dt 



< exp 



T 



OO 

E 

k=2 



c^ll/l 



k\ 



IT2 



Since the generator is Af{x) — ^ J^y^xifiv) ~ '^^ use the estimates 2d times 

and divide by 2d, so no additional constants appear in the results. 

5 Application: Simple symmetric exclusion process 

This example is somewhat more involved(because of the conservation law), and shows 
the full power of our approach in the context where classical functional inequalities 
such as the log-Sobolev inequality do not hold. 

The simple symmetric exclusion process is defined via its generator 



2d 

It is known that the large deviation behaviour of the occupation time of the origin 
Jq ^t(O) dt is dependent on the dimension [5]. Its variance is of order in dimension 
d — 1, Tlog(T) in dimension d — 2 and T in dimensions d > 3 [T]. Here we will show 
the same kind of time dependence for the exponential moments, in dimension d — 1 for 
functionals of any quasi- local function /, and in dimension d > 2 for the occupation 
time of a finite set A. 

Theorem 5.1. Let f : {0,1}^ — > M 6e such that \\\ f \\\ < 00, and fix an initial 
configuration 7/0 G {0,1}^- Then 



E r,o exp ( f{r]t) dt-E^o f{rit) dt j < exp 



k=2 



and the constants Ci,C2 > are independent of f , r/o and T. 

While it is natural to assume the same kind of result in all dimensions (with a 
properly adjusted dependence on T), we can only prove it in high dimensions((i > 5, 
see application of Theorem 14.11) or for a subset of the local functions, the occupation 
indicator HAiji) := Y\ vi^) of ^ finite set A C U^, with a slightly worse dependence 

on the function(i.e. \A\). 

Theorem 5.2. Let A d he a finite, and fix an initial configuration tjq e {0, 1}^ . 

Then, for all X > 0, 

E no exp ( J XHaM dt-EnoJ XHA{vt) dtj <e 
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where a{T) € 0{T^),0{\ogT) or 0(1) in dimensions d = 1, d = 2 or d > 3. The 
constant O is independent of A, rjo and T , hut may depend on the dimension d. 

The proofs of Theorems 15.11 and 15.21 are subject of the two subsections below. For 
Theorem 1 5. 2 [ we will only look at d > 2, the case d = 1 is contained in Theorem 15. II 

5.1 Concentration of quasi-local functions \n d= I: Proof of 
Theorem 15.11 

Let / be a quasi-local function. To derive an exponential estimate, we will create a 
coupling between the exclusion process started in 77 and started in r]^^: 

Proposition 5.3. There exists a coupling P,,,,,-^!/ o/P^ and P^xh for which 

^V:V^y\]{z)^r,''t(z) ^ C\pt{x,z)-pt{y,z) I 
holds for some constant C > 0. 

Proof To couple two exclusion processes with almost identical initial conditions, we 
use a variation of the graphical representation to describe their development, which 
is the following: at each edge between two consecutive integer numbers, we put an 
independent Poissonian clock of rate 1, and whenever this clock rings we exchange 
the occupation status of the sites which are connected by the edge associated to the 
clock, which is represented by a double sided arrow. Now, to couple P^ with P , we 
instead take Poissonian clocks of rate 2, and additionally a sequence of independent 
fair coin flips associated to the arrows. For both 77^ and rf , which use the same 
arrow configuration, if the coin flip corresponding to an arrow is tails, that arrow is 
ignored, with one exception explained a bit later. First, we notice that this leads to 
effective rates of 1. Second, since we start with just two discrepancies (one at x and 
one at y), those remain the only discrepancies, and they perform independent random 
walk movements until they encounter the same arrow, which leads us to the only 
exception of the mechanics described above: When there is an arrow connecting the 
two discrepancies, the exchange of process 77^ is suppressed if the coin flip is tails, but 
then if' performs the exchange, and if the coin flip is heads, 77^ performs the exchange 
and 77^ does not. After this event, ry^ and r/^ are identical. 

If we denote the position of the discrepancies by Xt and Yf , those perform indepen- 
dent random walks of rate 1 until they meet, then they stay together. Hence 

E^,^xal^l(^)_^^2(2) ^^x,y1Xt^Yt,z<^{Xt.Yt} < C \pt{x , z) - pt{y , z) \ , 

where we used the fact that in dimension 1, the independent coupling of two random 
walks is optimal and hence 

f'x,y(Xt = Yt = z)^pt{x,z) ^pt{y,z). 

□ 
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To apply Theorem 12.61 we have to estimate 



xei.j=±i 



Stf{v''^^+n~Stf{v)dt 



xGZj = ±l V"^*' z 

<c''Y.ii ( rE'^/wi?'*(^'^)-?'*(2/,^)Ni 

<^'E E rE'5/Wi?'*(^'^)-?'*(2^'^)i^* 

• sup sup / S2Sf{z)\pt{x,z) - pt{y,z)\ dt] 



/o 

where we used Proposition 15.31 to obtain the third hne. To continue, we 



/' J WTVar 



j = ±l z 

<c\\\f \\\Vt 



Next, 



sup sup / ^Sf{z)\pt{x,z) -pt{x + j,z)\ dt 
< III / III sup sup / \pt{0,z)~ptU,z)\ dt 

j=±l z Jo 



III / III sup / pt{0,z)-pt{-l,z)dt. 

z>0 Jo 
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In order to control the supremum over z on the right hand side of the last line, let tq 
denote the first time a simple symmetric random walk (Xi)t>o hits 0. Then 



Pt(0,z)-pt(-l,z)di = / pt(O,0)dt-E 

/ Pt(0,z)dt 

T 



Pf (0, z) 



E 



TO AT 



= -1 



< E 



Pt(0,0)di 



^0 



-1 



T-roAT 

Pt(0,0)-K(-l,0)dt C< CX3. 



Hence 



< III / If VrCiC^^ 

for suitable constants C\ and C2, and Theorem 12.61 implies 



(C2 III / IIP" 



E.expU^ firjt)dt-E^ fivt)dt\<e 
for any initial configuration 77. 

5.2 Concentration of the occupation time of a finite set in d > 2: 
Proof of Theorem 15.21 

Now, we want to show that the occupation time HAirjt) dt, _ff^(r/) :— YiaeA ^('^)' 
a finite set A C Z'^ has the same time asymptotic behaviour as the occupation time of 
a single site. As a stating point to estimate i | $t 1*^ ('i'?): use the following result 
of i: 

Theorem 5.4. I2L Theorem 2.2 



aeA 



\ZHA\ 



Here Pa{Xs = Z) is the probability of exclusion walkers started in A occupying the set 
Z at time s, and pl{z) — E^7/t(z) is the occupation probability of z at time t given the 
initial configuration rj. 
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By using this comparison of exclusion dynamics with independent random walkers, 
we get 



aeA aGA 



aeA 



aeA 



aeA 



aeA 



aeA 



aeA 



\aeA aeA / zcX^ zi,Z2eZ 

\Z\ = \A\ zi^Z2 



(pf:(^i)-pn(^2))^ n pn(^3)-(p?_«(^i)-p?_.(^2))^ n p't- 



zsez 

237^21, Z2 



zsez 

235^21, 22 



Taking absolute values, we start to estimate the first difference: 



aeA aeA 



aeA aeA 

The next part is the big difference inside the integral. It is estimated by 

I (pHC^i) - pf-s{z2)f - {pUi^i) - pU{z2)? 

z^ez 

Z3^Zt_,Z2 

Now we come back to the original task of estimating L | $t j'' (•, r;). From now on, 
multiplicative constants are ignored on a regular basis, which results in an omitted 
factor of the form Cicf. However warning is given by using < instead of <. By using 
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the above estimates, we obtain the upper bound 



f dt ( ds "^AiXs^Z) P(^l'^2) 



\ 



\Z\ = \A\ 



( 



f dt f ds Y ^a{Xs^Z) Y P(^1'^2)- 



\Z\ = \A 



Z\ ,Z'2^Z 



which we will treat individually. 

For term ([5]) , we estimate sum over A by the maximum times \A\. Hence 

(© < l^l"' X X p(2:,y) ( /" \pt{x,ao)-pt{y,ao)\dt\ . 



We note that 



and 



sup / \pt{x,aa) - pt{,y,aQ)\ dt 

< sup / pt{0,0)-pt {j, 0) dt < 00 
b- 1=1 Jo 

Y ^P(^^y)[ \Ptix,ao) -ptiy,ao)\ dt\ 



2d 
<a{T) 
by Lemma 14.21 Hence 
© < |A|'=a(r), 



a;, ao + j) dt 
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Next, we must treat ([7]). In the case k = 1, 



PtZizi) - pVs{z2) ~ Pt-sizi) + p1-s{z2) 
Pt-sizi) " pVsiz2) + Pt^sizi) ~ pLAz2) 



(9a) 

(9b) 
(9c) 



Regarding the exclusion walkers Xs in ((9a| . we can simplify by using Liggett 's corre- 
lation inequality (fTj, chapter 8): 

^a{Xs = Z)= Pa{zi,Z2 e Xs) < Pa{zi e Xs)Pa{z2 e Xs) 

Z:zi .Z2^Z 



Yps{zi,a) \ iYps{z2,a) 

\aeA ) VaeA / 



Lemma 5.5. For | i | , | j | =1, 

a) For any rj, 

< \ pt{x,z) -pt{x +j,z) ~pt{x,z + i) + pt{x + j,z + i) I , 

b) J2 \Ptix,z)-pt{x+j,z)-pt{x,z + i)+pt{x+j,z + i)\<{l + t)^'^. 

Part h) holds as well when we sum over z instead of x. 
Proof First we notice that 



pr{z)-pnz) = { 



Pt{y,z) - ptix,z), r]{x) = l,T]{y) ^ 0\ 
Pt{x,z) - pt{y,z), f^{x) =0,'q{y) = 1; 
0, otherwise, 
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which immediately proves a). To show b), 

\pt{x,z) -pt{x + j,z) -pt{x,z + i) +pt{x + j,z + i) I 

= X! I ^Pt/2{X, U)pt/2{u, z) ~ Pt/2{x + j, U)pt/2iu, z) 

X u 

- Pt/2 {X, u)pt/2 {U,Z + i) +pt/2{x + j, u)pt/2 {u, Z + i) 

< I (Pt/2{x,u) -Pt/2{x+j, U)){pt/2{U, Z) - Pt/2{U, Z + i)) \ 

X u 

= ^\Pt/2{u,z) ~ Pt/2{u,Z + i)\^\pt/2{x,u) -pt/2{x + j,u)\ 

U X 

= 4 II Pt/2(0, •) - Pt/2{i, ■) Wrvar II PtM^^ " Pt/2{j, ') WrVar 

< (l+i/2)-5(l+t/2)-^ < 2{l + t)-\ 

where the last line relies on optimal coupling of two random walks, see for example 
0- □ 



As a third observation, 



^{Pt{zi,x) -pt{z2,x))r]{x) 



< \\Pt{zi,-)~Ptiz2,-)\\TVar^ 

which leads to the estimate 

m < 2 Wpt-sizi, •) - Pt-siz2, •) WrVar ^ (1 + ^ ^ s)"^- 

Applying the estimates for (l^a)) to (Pc)) . we have (for fc = 1) 



(10) 



S) 2 



<2d\A\^ ( dt ( dsp, 
Jo Jo 



(0,0)(l+t-s)- 



<|Ar/ dt ds {l + s)-2(^ + t-sy 
Jo Jo 

<\A\'a{T), 
where the last line is due to the following lemma: 
Lemma 5.6. 

i-T 



n — 1: 



dt ds (l + s)"^(l+t-s)-2 < log(l+T), n^2; 

'0 Jo , ^ Q 

n > 3. 
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Proof Write 



f{m,n):= / (1 + s)"" (1 + t - s)~ ^ rft. 
Jo 

Then / satisfies /(m, n) < {1 + t)^^{f{m — 1, n) + /(m, n — 1)) for m, n> 1: 



<ii + ty 



lo (l + s)^(l + i-s)2 

. f^(l + s)i + il + t-s)i 



■dt 



dt 



Iq {l + s)-{l + t-sy. 
= (1 + t)-^{f{jn - 1, n) + /(m, n - 1)). 

Also, /(n,0) = /(0,n) < (1 +t)5,log(l + i) or 1 for n = 1, n = 2 or n > 3. Using 
these two rules we obtain the given estimates. □ 

As we have already dealt with ([7]) when fc = 1, we use the simple fact 

X 

X X 

to generalize to any k. However, we must show that ([7]) is bounded by a constant 
when we replace the sum by the supremum. When we use the same initial estimates 
as above, we get 

sup sup / dt ds ^ ^ i ^ps{a,zi) ^^^(0,22) 

xeZ^V-^xJa Jo ,^ez..,^zAaGA / \aeA / 

■ I Pt-s(x, Zi) - Pt-s(x, Z2) - Pt-s{y, Zi) + Pt-s{y, Z2) I II Pt^s{zi, ■) - Pt-s{z2, ') WrVar ' 

and by taking the sum over zi over the pt-s differences. 



< 



dt / ds {\A\p,{Q,Q)f{l + t-s)—- 
Jo 

t 



<\Af f dt f ds il + s)-''{l+t-s)-^ <\A\^ if(i>2. 
^0 Jo 



Hence, finally, we have obtained the estimate 

O <\A\'"'a{T). 
Part ([5]) is treated in a similar way: 



Z\ = \A\ Zl#Z2 Z3#Zi,Z2 

^ X! X! bt-s(y,2:3) -Pt-s(a:^,2;3) I ||Pt-6(zi, •) -Pt~s{z2, •) I 



2 



zi Z2~zi Z3 1=1 VaeA 
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By using the fact that 

^\pt-s{x + J,z) - pt^s{x,z) I =2\\pt-s{jr)-Pt-s{0,-) Wrvar 

X 

we can sum over x to obtain another power of the total variation distance. Also, 



E E En(E?'^(«"^^)) <2d|A|=^p,(o,o), 

21 Z2~2i 23 i=l \a6A / 



hence we obtain the compound estimate 
M|^p,(0,0)(l + t-s)-3/2, 

which after integrating over s and t is again of order a{T). When we take the supremum 
over X, we can instead take the sum over on the middle term. Hence we keep another 
Ps(0,0) and we get and get 

|A|3p,(0,0)2(l + f-s)-3/2, 
which after integration is of order 1 if d > 2. Hence, 

dll) <\A\'''a{T). 
Returning to the original question, 

L\^tt{-,i^){i^)<\A t a{T) + \A\'' a{T) + | A 1^'= a{T) <\A\"' a{T), 
and after replacing < with <, 

L\^tt{:v){v)<c,^2\Ata[T). 
Now that we have this estimate, Theorem 12.61 gives us the estimate 

( fT rT \ / °° I /I l3\k' 

E^exply XHAivt)dt-E^ J XHAivt) dt j < e^p \TaiT)ci ^ 
where the constants ci and C2 do not depend on T or A. 
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